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Abstract 

C**) ■ We present convergence results for an adaptive algorithm to compute free energies, namely 

the adaptive biasing force (ABF) method [5] IB]- The free energy is the effective potential 
associated to a so-called reaction coordinate £(q), where q = (gi, . . . , g3jv) is the position vector 
of a A^-particle system. Computing free energy differences remains an important challenge in 
molecular dynamics due to the presence of meta-stable regions in the potential energy surface. 
Ci The ABF method uses an on-the-fly estimate of the free energy to bias dynamics and overcome 

metastability. Using entropy arguments and logarithmic Sobolev inequalities, previous results 
have shown that the rate of convergence of the ABF method is limited by the metastable 
features of the canonical measures conditioned to being at fixed values of £ [14]. In this paper, 
we present an improvement on the existing results, in the presence of such metastabilities, 
which is a generic case encountered in practice. More precisely, we study the so-called bi- 
^. ■ channel case, where two channels along the reaction coordinate direction exist between an 

\Q ' initial and final state, the channels being separated from each other by a region of very low 

probability. With hypotheses made on 'channel-dependent' conditional measures, we show on 

OQ ■ a bi-channel model that we introduce, that the convergence of the ABF method is in fact 

not limited by metastabilities in directions orthogonal to £ under two crucial assumptions: (i) 

ly-v , exchange between the two channels is possible for some values of £ and (ii) the free energy is a 

^"-) ' good bias in each channel. This theoretical result supports recent numerical experiments [15| . 

where the efficiency of the ABF approach is demonstrated for such a multiple-channel situation. 
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1 Introduction 



Wc consider a system of N particles with positions q G V C R . In statistical physics, one is 



interested in calculating averages with respect to the Boltzmann-Gibbs measure 

dv{q) = Z- l zxv{-pV{q))dq, (1) 

with V : T> — >• R the potential energy function, Z = J v exp(— (3V(q)) dq the partition function, 
T> = {q,V(q) < oo} the configuration space, and j3 = l/(fcsT), where ks is the Boltzmann 
constant and T is the temperature. The function V is the energy associated with the positions 
of the particles and is assumed to be such that Z < oo. The probability measure v represents 
the equilibrium measure sampled by the particles in the canonical ensemble. A typical dynamics 
that can be used to sample this measure through trajectorial averages is the overdamped Langevin 
dynamics 

dQ t = -W(Q t )dt + ^JW^dBt, (2) 



where (B t )t>o is a 3-/V-dimensional standard Brownian motion. Indeed, under loose assumptions 
on V, one has the ergodic property: for any smooth test function tp, 



lim — / ip(Q t )dt= / ipdv. 



T^oo T 

The efficiency of this sampling procedure, which can be shown to be related to the convergence 
rate to equilibrium of the above dynamics is often hindered by metastabilities in the potential 
function V, namely regions of low energy are separated by high energy barriers. Equivalently, 
in terms of the probability measure, v is typically a multimodal measure, with regions of high 
probability separated by regions of low probability. To circumvent this issue, a one-dimensional 
collective variable (or reaction coordinate) £ : T> — >• Ai is introduced, which will be used to define 
a biasing potential for ([2]). In the following, we will assume that |V£| > on T>, and that Ai = T, 
where T = R/Z is the one-dimensional torus (which typically corresponds to the case where the 
reaction coordinate represents a dihedral angle, for example to characterize the conformation of 
a molecule). Before defining more precisely the biased dynamics in the next section, we need to 
introduce a few notation. 

The image of the measure v in £ is given by 

d^(z) = Z _1 exp(-/3A(z)) dz, 

where A is the so-called free energy, defined by 

A(z) = -r l HZz) (3) 

where 

Z z = f cxp(-/3U(g)) S i(q) _ z (dq) (4) 

is the partition function on the submanifold E z = {q <G T> \ £(g) = z}. The measure S^ q ^_ z (dq) is 
defined through the conditioning formula: for any smooth test function ip : T> — > R, 

<p(q)dq= / ip{q)5 aq) ^ z {dq) dz. 

v Jm Jt, z 

Using the co-area formula (see [14[ Appendix A]), one can also identify this measure as S^ q ^ z (dq) = 
|V£| _1 cf(Ts_, where a^ z is the Lebesgue measure on E z . We assume in the following that £ and V 
are such that Z z < oo, for all z £ M.. 

Practitioners are typically interested in free energy differences A(z) — A(zq), which can be 
obtained by computing (and integrating) the derivative A'(z), the so-called mean force 

A\z) = Z- 1 f /(g) exp(-j3V(q)) 6^ q) . z (dq) (5) 

JT, Z 

where / is the local mean force defined by 

f = H?W— dw KWW)- (6) 

The function / can be understood as the negative force projected onto the reaction coordinate, 
plus some correction term related to the curvature of the submanifolds S z . Notice that the mean 
force |[SJ) is in fact a conditional expectation 

A'(z) = E„[/(Q)|£(Q) = z] = f fdv\ z , (7) 



where 

exp(-/3y( g ))^ (9) _,(d g ) 

|z exp(-/3(A(^))) 

denotes the probability measure ^ conditioned to a fixed value z of £(<z). This measure is supported 
on the submanifold E z . For the derivation of ([5])-(|6|)-([7|) which is again based on the co-area 
formula, the reader is referred to [U [TY] [3] ■ 

1.1 The adaptive biasing force method 

The adaptive biasing force (ABF) method [SJ |5] uses an estimate of the mean force A' to bias 
the standard overdamped Langevin dynamics @ in order to overcome metastabilitics in £. The 
bottom line of the approach is that it should be easier to sample the probability measure with 
density proportional to cxp(— j3(V — A o £)) than the original Boltzmann-Gibbs measure v, since 
the marginal probability of the former in £ is a uniform probability measure on T, while the 
marginal of the latter (namely exp(— (3A(z)) dz) is typically multimodal. 
The ABF dynamics is given by 



f dX t = -V{V-A t o$(X t )dt + V2p ZI dB t , 
\A' t (z)=E[f(X t )\t(X t ) = z], 

where A' t is an on-thc-fly estimate of the mean force, which, in view of the definition ([7]) , is expected 
to be a good estimate of A'. The law of X t has density ip(t, •), which satisfies the non-linear Fokkcr- 
Planck equation: 

d t ip = div(V(V - A t o g)ij>) + /3- x A-0, 

/ f i 1 h{q)-z{dl) (Q) 

A>(z) = Ilk . [ ) 

/ i> 5t(q)-z(dq) 

Roughly speaking, the biasing force V(A t o £) "flattens the free-energy barriers in £". To support 
this claim, let us simply indicate that if |V£| is constant, the marginal density in £ satisfies a simple 
heat equation, with zero bias, see JT3J [T3] and also Equation (|T5)) below. Existence and uniqueness 
of solutions to (JU) are studied in [11] and a study of the longtime convergence of (HJ) can be found 
in [14] , the results of which are briefly discussed below. 

1.2 Existing convergence results, and the multiple channel scenario 

It has been shown in [14] that, under appropriate assumptions, the biasing force A' t in ([8]) (actually 
for a slightly different dynamics which reduces to (JE) if |V£| is constant for example) converges to 
the mean force A' exponentially fast in the longtime limit. The rate of convergence was estimated 
as the minimum of (i) the rate at which the law of £(Xt) converges to equilibrium, and (ii) the 
smallest logarithmic Sobolcv inequality constant (LSI constant, discussed in Section I2.3[) of the 
conditional probability measures v\ x , for z G A4. Thanks to the bias in the direction of the 
reaction coordinate, it can be shown that the rate of convergence of the marginal in £ is actually 
not the limiting rate in practice since it satisfies a simple diffusion equation. The real limitation is 
thus the metastable features (i.e. the multimodality) of the family of laws fi z , which is quantified 
through the LSI constants associated to these measures: roughly speaking, the smaller the constant 
the more multimodal the probability measure. These constants may be in some cases extremely 
small, at least for some values of z £ M. . The question we address in this paper is the optimality 
of this theoretical rate of convergence for the ABF method. 

The generic situation is indeed that the LSI constants for the measures isu arc not small 
uniformly in z. Typically, there exists some values of z for which these measures are multimodal. 



This situation is often encountered in practice due to two reasons. First, finding a suitable reaction 
coordinate, namely in our context one that ensures that there are no metastabilities associated 
to the equilibrium measures V\ z , is not trivial for a large-dimensional system. Secondly, a low- 
dimensional reaction coordinate may simply not be sufficient to describe all metastabilities of the 
system. In such cases, the results of P3] predict a very small rate of convergence for the ABF 
dynamics ((8]), and thus, the inefficiency of this biasing procedure. 

As a typical case for which such difficulties appear, we will consider in the following the so- 
called 'multiple channel situation' (sec Figure [1] on the left, for a bi-channel case). In such a 
situation, starting from a metastable state and as the system evolves in the direction of increasing 
values of the reaction coordinate £, it can follow different channels, which are separated (in the 
'orthogonal direction to £') by arbitrary high energy barriers. In other words, the energy landscape 
features parallel valleys which are orthogonal to the isocontours of the reaction coordinate. In such 
a prototypical situation, the conditional probability measures v\ x are indeed multimodal, for the 
values of z corresponding to the system being in one of these channels. 

However, recent numerical experiments [TS] (based on a discretization of the ABF dynamics © 
by multiple walkers simulated in parallel) suggest that in fact, high energy barriers in E z do not 
always hinder the convergence of the ABF method. The multiple walkers are made to follow similar 
dynamics (|8|), but driven by independent Brownian motions. The chemical system considered in |15j 
is the compact states of the deca-alanine peptide (the reaction coordinate £ being thus the end- 
to-end distance of the peptide). Due to some 'buckling effects', this is a typical multiple channel 
situation, since the molecule can shrink to various compact states (see [5]). In J15) . numerical 
results show that the ABF approach indeed yields reliable results in such a situation. We interpret 
this as follows. When encountered with a fork in the channel, each walker is likely to travel down 
a different channel. Thus, it is indeed almost impossible for a given walker to switch from one 
channel to another, once it has entered one of them, but this appears not to be necessary to obtain 
reliable results. It suggests that the theoretical rate of convergence obtained in [Uj is actually not 
optimal. 

Inspired by these numerical results, we present herein an improved theoretical rate of conver- 
gence of the ABF method. The rate will be shown to depend on the LSI constants of the family of 
equilibrium measures conditioned to being in E 2 and a channel. By doing so, we show that high 
energy barriers separating the channels do not in fact affect the rate of convergence of the method. 
The crucial assumptions needed to show our result are: (i) exchange between the two channels 
is possible for some values of £ (see [HI] below) and (ii) the free energy is a 'good bias' in each 
channel (see [H4] below). This is formalized in the main result of this paper, namely Theorem [T] 
below. 

For some technical reasons, we were actually unable to prove this result on the original ABF 
dynamics (|8]). Wc will thus consider a slightly different system (that we call the bi-channel model) 
which retains the most important features of the dynamics ([8]) when applied to a potential exhibit- 
ing two parallel channels in the direction of £, separated by a high energy barrier. 

The paper is organized as follows. In Section [2] we give details of the bi-channel model, define 
some probability measures and recall some entropy definitions before presenting the main result. 
Finally, the proof of the main result is given in Section [3] 

2 The bi-channel model and statement of the main results 

In this section, we present a model to describe the bi-channel scenario. In the following, wc treat 
the case d = 2 (so that the position is q = (x,y)), D = TxK, M = T and £ : T> — > T, where 
£(x, y) = x. We further assume without loss of generality that /3 = 1, which can be done by a 
change of variables i = /3 _1 i, ip(i 7 x) = i/j(t,x), V(x,y) = /3V(x, y). With these assumptions, some 
notation may be simplified: |V£| = 1, £ z = R and 5^f q \- z (dq) = dy. Furthermore, the definition 



of the local mean force in ([6]) simplifies to 

f = d x V 
and the free energy and its derivative are given by 

/ d x V{x,y) cxp(-V(x,y)) dy 
A(x) = -In / exp(-V(a:,tf)) dy and A'(x) = il . (10) 



exp(-V(x,y)) dy 

We would like to emphasize that the choice of the domain Ai = T and reaction coordinate 
£(x, y) = a; is merely to reduce technicalities, see [14] for appropriate tools to treat general £. 
In particular, the results can be straightforwardly generalized to the case T> = T x R d ^ 1 and 
£(<?i, . . . ,qd) = <?i- Likewise, the generalization to a situation with multiple channels (more than 
two) is straightforward (see Remark Q] below for another generalization). 

2.1 The bi-channel model 

The bi-channel situation is characterized by the existence of two channels joining an initial and 
final state on a potential energy surface V, separated from each other by a region of high energy, as 
depicted in the left of Figure [TJ As explained above, we were not able to analyze the original ABF 
dynamics ([8} in this situation, because of some technical difficulties in expressing the probability 
density flux from one channel to the other. 

We therefore analyze the convergence of the ABF method for a slightly different model, which 
is schematically represented in right of Figure Q] Each channel is described by a potential energy 
function Vi : T> — > R, where i <E {0, 1} denotes the channel index. The stochastic process we 
consider now is actually a couple {X t ,It), where the position vector X t lives at time t on the 
potential Vi tl It € {0, 1} being the index of the visited channel at time t. The channel index I t is 
allowed to switch to 1 — It only if £(A t ) lies in some designated regions (typically at the two ends 
of the two channels) . 

The dynamics for the ABF dynamics in the bi-channel model is then 

dX t = -V(V h - A t o £)(X t )dt + V2 dB u 
A' t (x)=E[d x V It (X t )\Z(X t ) = x], 
It £ {0, 1} is a jump process with generator 

Lip(x, y, i) = -X(x)((f(x, y, i) - <p(x, y, 1 - i)). 

In terms of the stochastic process It , switching between the two potentials (namely change of It 
to 1 — I t ) occurs at times 



Tn+l 



ini< t> T n 



\(S(X a ))ds > T n 



where tq = and T n are i.i.d. random variables drawn from the exponential distribution with 
parameter 1. In this way \(x) denotes the rate at which the trajectories jump from potential Vi 
to potential Vi_j. Note that this rate depends only on the position, x, in the reaction coordinate. 
The bi-channel feature of the model is related to the fact that we assume that the rate A is zero 
(there is no switching) in some region of the reaction coordinate. Outside of this region, the rate 
is supposed to be constant, and the potential functions are assumed to be identical (the particles 
live in the same potential). Let us state this as a formal assumption, 

[HI] 3£cT, X(x) = Xl T \e{x) and Vx e T\£,V (x,-) =Vi(x, •)■ 





Figure 1: Left: Contour plot of a 2-dimensional potential energy surface demonstrating the bi- 
channcl scenario. Right: In the bi-channel model the two channels are described by two potential 
functions Vi : T> — > R, i G {0, 1}. Exchange between the two channels is allowed only in regions 
T\£ at a rate A > 0. 



The region £ C T in the above hypothesis represents the region where the two channels are 
separated by high energy barriers, see Figure [1] It is assumed to have a Lebesgue measure different 
from and 1. 

The main qualitative difference between the bi-channel model we study, and the original ABF 
dynamics (JSJ is that the switching only depends on the x-position and not on the y-position. 
However, the ABF dynamics (TTTj) conserves the main difficulty of the original one, namely the 
metastability of the dynamics in terms of visited channels for some values of £. At times t such 
that £(Xt) G £ , It cannot switch to 1 — It . In particular, it can be checked that the proof of [33] 
applied to (JTTJ) in the case £ = leads to an estimated rate of convergence limited by A and is 
thus eventually zero if A goes to zero (see Remark [3] below) . The aim of this work is to study the 
case £ ^ and to obtain an exponential rate of convergence even if A = in some region (see 
Theorem Q] below) . 

Remark 1 On Figure [7J we represent the bi-channel case with two metastable states linked by 
two different channels. The region where the stochastic process can jump from, one channel to 
another has two connected components. We would like to emphasize that our result also applies 
to the case where this region is simply connected, namely a situation where two channels start 
from a metastable state (along the reaction coordinate value) but do not end in another metastable 
state. This situation is of course less favorable in terms of speed of convergence to eguilibrium, 
which would be reflected in our theoretical result through the parameter 6 (see [H4] below). This 
is actually typically the situation of the numerical experiments in \15f mentioned above since the 
various compact states obtained do not belong to the same metastable basin. 



2.2 A partial differential equation formulation 

Let us introduce the time marginal of the process (X t , It) in (TTTJ) : 



dfH = X! ^(*> x > V-> i)fo dx dy, 



i=0 



where 5, is the Dirac measure on the singleton {i}. When necessary, we shall denote the i- 
dependency of the density by a subscript ipi, so that ipi—i denotes ip(t, x,y,l — i) for example. The 



evolution of the densities are described by a system of non-linear partial differential equations: 

V*e{0,i}, 

d*il> = div(V(Vi - A t o 0^) + Ai/> - A o £ [4, - ^_i] on V, (12) 

where, we recall, the last term is zero for £(x,y) £ E. The non-linearity is due to the definition of 
the mean force estimate, given by 

V / d x Vi(x,y) tp(t,x,y,i) dy 
A' t (x) = i=^- 1 . (13) 

Y] / ip(t,x,y,i) dy 

*=o Jr 

Using hypothesis [HI], it can be checked that if tpoo is defined as a probability density proportional 
to e - ( yi ~ Aoo °£) where ^oo is a given long-time limit for A t , then ipoo is a stationary solution to (fT2|) . 
Then, by replacing ijj by i/'oo in (|13|) and comparing with the definition of the mean force (|10[) , it is 
clear that by choosing A' x = A 1 , one obtains a stationary solution of the system (fl"2"j) -([T3 | written 
as: 

e -(V t (x,y)-A(x)) 

ipoo{x, y, i) = — . (14) 

J2 I e~^- A °V dx dy 
i=o Jv 
The associated equilibrium measure for the process (X t ,It) writes: 

1 
dfioo = *^2ipoo(x,y,i)5i dx dy. 

We will need further notation for marginal and conditional laws associated to ^ and fioo. The 
next two sections give precise definitions for these measures, and Table Q] summarizes the notation. 

2.2.1 Marginal laws 

We are now in the position to define marginal probability densities. The image of the probability 
measure fit in £ and / is denoted by 

d[it = y ijr' (t,x,i)Sidx where ip (t,x,i)= / ^(t^x^y, 1) dy. 
*=o J * 

The evolution of the density %j. i*' 1 is described by the (non-closed) partial differential equation 

d t ^ J = I d x (d x (Vi - A t o £)V0 dy + d xx ^ J - \(x)(4>^ - ^), (15) 

Js. 

obtained by integrating (j!2[) in y. The associated equilibrium measure is 

^ = ^2'tpii I (x,i)5idx where ip^(x,i)= / 4>oo(x, y,i) dy. 

i=0 

The marginal measure in £ only is denoted by 

1 1 f 

dfif = i/>* (t, x) dx where ip* (t, x) = 2, y' (^:X,i) = y / ^(f, x, y, i) dy. 

?:=o i=o J R 



Measure 


Definition 


Description of measure 


dfi t 


1 
^2<i/j(t,x,y,i)8i dx dy 

i=0 


Probability measure in domain T> x {0, 1} 


duf 1 


1 

y i/)£' (t,x,i)Sidx 

i=0 

tl>^(t, x) dx 


Marginal measure in reaction coordinate and channel 


d(4 


Marginal measure in reaction coordinate 


dHt\x,i 


ip(t,x,y,i) dy 
ipt> J (t,x,i) 

1 
^2ip(t,x,y,i)5 l dy 


Measure conditioned to being at x and i 


dnt\x 


i=0 

^(t,x) 
l 


Measure conditioned to being at x 


M\ x 


i=0 

i>S(t,x) 


Marginal in /, conditioned to being at x 



Table 1: A list of probability measures. 

By summing (fT"5|) over i and using the definition (fT3|) of A' t , it is easy to check that xjfi satisfies 
a closed, very simple partial differential equation (this is similar to the original ABF dynamics, 
see dUCII]): 

dt^ = d xx ^ onT. (16) 

Thanks to the adaptive bias, along £, the barriers have been flattened. The long-time limit of t/>* 
is given by 



i>L(x) = ^2 ^oo(x, y,i) dy = 1, 



which corresponds to the uniform probability measure on the torus T. 



2.2.2 Conditional laws 

Let us introduce the measure fi t i x of (X t ,I t ) conditioned to being at a specified point x in the 

reaction coordinate: 

l 



d\i t \x = 



Its long-time limit is 



^Moolx 



^2tp(t,x,y,i)6i dy 

i=0 

ilfifax) 

1 

^2^oo(x,y,i)6 l dy 
2=o 

ipL(x) 



(17) 



(18) 



To study the bi-channel model, we will also need to introduce the measures conditioned to being 
at fixed £ and a particular channel i £ {0,1}. This measure and its long-time limit are defined 



respectively by 

i/)(t,x,y,i) dy ^oo(x, y,i) dy 

d ^= WJt^J) and dti -^ = Wm ■ (19) 

Finally, we will also need the marginals in I of the probability measures u t i x and yLoo\x- These 
Bernoulli probability measures u t , and fi^, represent the proportion of the marginal distribution 

ip£(t, •) and ip£ in each channel. They are formally defined as 

1 1 

5> e,/ (t,M)*< J2^( x ^ Si 

&\A.\x = — — 7771 — \ an d da. L.i_ = — 7 . (20) 

2.3 Entropy and Fisher information 

In this section we recall some well-known results for defining relative entropy between two prob- 
ability measures, which can be seen as a measure of the 'distance' between those. A general 
introduction to this topic can be found in [TJ I18| and applications to study the longtime behavior 
of Fokkcr-Planck type equations are presented in [5] . 

Definition 1 (Entropy). For any two probability measures a and v such that a is absolutely 
continuous with respect to v (denoted as a -C v), the relative entropy is defined as 



H{u\v) = Jh,(^jdu. 



The positivity of the relative entropy can be shown using the inequality ln(l/x) > 1 — x. Further- 
more, H{u\v) = if and only if u = v. 

Definition 2 (Csiszar-Kullback inequality) . For measures a and v which have densities 
with respect to the Lebesgue measure, the following holds 

\\H-v\\ L i<y/2H(jJ,\ V ). 

This allows us to control the Z^-norm of the difference of two probability measures by their relative 
entropy. 

Definition 3 (Fisher information). For any probability measure fi absolutely continuous with 
respect to v, the Fisher information is given by 



F(u\v) 



1^ 
av 



d/j. 



Definition 4 (Logarithmic Sobolev inequality) . The probability measure v is said to satisfy a 
logarithmic Sobolev inequality with constant p > (in short: LSI(p)) if for all probability measures \i 
such that \x -C v, 

H{u\v) < ±-F{u\v). 
2p 

Such an inequality holds for Gaussian measures [7] for example, and more generally [3] for any 
measure v with density proportional to e~ v , where V is a-convex (in which case the LSI constant 
p is equal to a). Besides, there exists a perturbation result [10]: if v is a probability measure such 
that dv J dv = e , where v satisfies a LSI(p) and U is a bounded function, then v satisfies a LSI with 
constant p = per' OBC ^- u \ i where osc(C) = sup(f7) — inf(CT). Thus, a very large class of probability 
measures satisfy a LSI. An important feature of the LSI constant is that it degenerates to zero in 
the case where the probability measure is multimodal. For example, if dv = Z _1 cxp(— j3W{x)) dx 



and W(x) = x 4 /4 — x 2 /2 is the double-well potential in dimension 1, then the LSI constant scales 
as exp(— (3 AW) where AW = W(Q) — W(l) > is the height of the barrier. Such inequalities thus 
hold under rather loose assumptions, but the constant p is very small for a multimodal measure. 
For example, in typical situations encountered in molecular dynamics, the LSI constant for the 
measure v defined in (fT]) is extremely small, which is related to the fact that it is difficult to sample 
directly this measure. 

To analyze the convergence of the ABF dynamics for the bi-channel model, we make use of 
LSIs for the conditional measures Poo\x,i (see assumption [H3] below), which are the equilibrium 
canonical measures, conditioned to being at a fixed value of £ and in a channel. In [14] . log-Sobolcv 
inequalities for the equilibrium canonical measures conditioned only to being at a fixed value of £ 
were considered, but in the bi-channel case, those are typically very small due to the presence of 
high energy barriers 'orthogonal' to the isocontours of the reaction coordinate (see again Figure [T]). 

Let us now define the Wasserstein distance between two probability measures. 

Definition 5 (Wasserstein distance). The Wasserstein distance with linear cost between prob- 
ability measures p and v is defined as 



W(u,u) = inf / \y-y'\ n{dy,dy'), 

where H(p, v) denotes the set of coupling probability measures on T> x T>, with marginals p and v . 

Definition 6 (Talagrand inequality). The probability measure v is said to satisfy a Talagrand 
ineguality with constant p > (or T(p)) if for all probability measures p such that p <C v, 



W{p,v)<^hi{p\v). (21) 

Logarithmic Sobolev inequalities and Talagrand inequalities are related (see [IB]): 
Lemma 1 If is satisfies LSI(p), then v satisfies T(p). 

Below, we present entropies that prove useful in obtaining convergence results of the bi-channel 
ABF model. In this paper, we are primarily interested in the convergence to a stationary state of 
the Fokker-Planck equation (fP2|) — (fT3)) . and thus of the associated partial differential equations (p~5]) 
and (|16l) . Relative entropies will therefore be defined for some probability measures with respect 
to their long-time limits. 

The total entropy will be denoted by 

E(t) = h(jh\hoo) = izj ln [if) $ dx d y- 



The so-called macroscopic entropy is defined as 

E M (t) = H(^ t \ul) ■■ 



M£)** 



4 
the local entropy at a fixed value x in the reaction coordinate by 



e m (t, x) = H"G«t|xl/"oo|*) = y~] / In 



^1 iPl)^ dV ' 



and the microscopic entropy by 



E m (t) = / e m (t,x) w{t,x) dx. 

Jt 



10 



With the above definitions, it is easy to show that 

E(t) = E M (t) + E m (t). 
In order to treat the bi-channel case, we define a channel-local entropy, defined by 



(22) 



e c \(t,X,i) = ff(Mt|jr,i|Moo|a!,i) 



hi 



^'7 v47 & 



7 dy. 



Two hypothesis that will be essential to the results presented below are: a so-called 'bounded 
coupling' assumption on the cross derivative d x , y Vi (see 0112), and an assumption on the loga- 
rithmic Sobolev inequality constants for the probability measures jUoo|a:,j' The hypotheses read 



Vi <G {0, 1}, Vi and £ are sufficiently differcntiable functions such that 3 C, M > 



[H2] 



|3*,„V-|| 



L°°(TxI 



< M < oo and 



J R dxVie Vi dy 



m 



dy 



<C*<oo, 



L°°(T) 



[H3] 



Vi e {0, 1}, Vi and £ are such that 3p > 0, Vx e T, Vi e {0, f }, 
the conditional measure fioo\ x .i satisfies LSI(p). 



The hypothesis [H3] gives us that Vx e T, Vi £ {0, 1} 



2p 



or equivalcntly 



In 






4? J^ dW -2^ 



( £ 



■</' 



i/j« 



7 */• 



(23) 



Finally, an entropy that appears in later calculations, is that of the Bernoulli measure \jL with 
respect to its long-time limit 



e c (t,x) = #(a4|JmL|J = H ln 



i=0 



& / 4 



v>« 



The so-called channel entropy is then given by 

E c (t) = / e c (t,x)il>* dx 



i=0 



£/* 



</> 



Z-l 






V' ? / ip: 







«,/ 



(24) 



2.4 The free energy as a bias in each channel 

As well as the hypothesis [H3] on the conditional measures /Uoolx.i) & n assumption will also be 
necessary to ensure that the free energy is a 'good bias' in each channel. More precisely, once the 
bias has converged, the marginals along £ in each channel must converge sufficiently quickly to 
their long-time limit. Roughly speaking, the channels should not be too 'asymmetrical'. The aim 
of this section is to state this more formally, see assumption [H4] below. 

Consider that the system is already nearly at equilibrium, in the sense that 



i) A' t = A^ 



A' 



ff 



ii) Vi G {0, 1}, / d x Vi dnt\ x ,i = / d x Vi dp^^. 
Then the marginal density ifj^' 1 can be shown to satisfy (sec (fTS"|) and (|40|) below) 

a t ^' J = -U^' 1 = 9, (^fitC^V^ 1 )) ~ A(x)(^' J - Vfi). (25) 

It can be shown that the operator L = (i/o,!^) is symmetric and positive definite with respect 

1 f 1 
to the inner product (/, g) = V, / fi(x)gi(x) —rj dx and has a spectral gap > (see 

Lemma [12] below) . In other words, L is such that for all functions / : T x {0, 1} — > K, with 
•^ G ffl ( ^4.) ^) > / ^ ° and Sl=o /t /i( x ) ^ = , we have 

(/,/}< J(/,L/>. (26) 

As will be apparent in the proof, it will be necessary for the spectral gap 9 to be sufficiently 
large. 



[H4] 



> 6> m in with 9 min — 

c 

where < c, M < oo are such that inf V>|i = c and su Px V^(0, %) = A/". 



We recall that the constants C, Af and p have been introduced in assumptions [H2] and [H3] 
above. The fact that c > is not restrictive since Vi £ {0,1}, Vi is a continuous function and 
*Poo c* /k e ~ dy > is continuous and defined on the compact space T. Similarly, since i/j^ 
satisfies the heat equation (|16p . the assumption M < oo is not restrictive. If, for example, the 
initial condition has a Dirac mass marginal in £, for any positive time to > 0, ip^{to, ■) is a bounded 
function, and one has simply to consider the dynamics on [to, oo). 

Remark 2 In the hypothesis [H2], the assumption on the cross derivative d XtV Vi could in fact be 
replaced by ||9: E ^||l oo (TxR) < M < oo, in which case, in [H4J, the minimum value for 9 would be 

changed to 9 mln = . ' and the Talagrand inequalities in LemmasX(h and [Jl\ would be replaced by 
Csiszar-Kullback inequalities. 

2.5 Main result 

We are now in position to present the main result of the paper. 

Theorem 1 Assume hypotheses [H1]-[H4]- There exists a smooth function A : (# m j n ,oo) — > (0,p) 
which is increasing and such that: 

A(p + 20 min ) = P - and A(0) -► ( ° US * ~* ^ min 
2 [ p as a —¥ oo 

for which we can prove the following convergence result: for any e G (0, A(0)), there exists a 
constant K > such that, Vi > 0, 

E m (t) < Kexp(-2 min{(A(6») -e),47r 2 }i) . (27) 

This implies that the total entropy E and thus \\ip(t, ■) — V'oollli converge exponentially fast to 
zero with the same rate. Furthermore, the biasing force A' t converges to the mean force A' in the 
following sense: Vi > 0, 

\A' t {x) - A'{x)[ 2 ^{t, x) dx < 2(C + Mp- 1 ' 2 ) 2 E m (t). 
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As a consequence, for any positive time to > and e £ (0, A(0)), there exists a constant K such 
that Vi > to, 

\A' t (x) - A' (x)\ 2 dx < Kcxp(-2 min{(A(0) - e)An 2 } t) . 



The term Air 2 corresponds to the exponential rate of convergence of i/j* to i/j^ (see (USD; and is 
clearly not the bottleneck. There are actually various ways to make this rate as small as needed 
(see Remark 11 in [14]). 

Thus, this result essentially shows that the ABF method converges at a rate which is limited 
by the multimodality of the equilibrium canonical measures conditioned to being at fixed value of 
£ and in a channel (quantified by the constant p) , if the free energy is a bias which enables a fast 
exploration of each channel (this is quantified by the constant 9, which should be sufficiently large 
for A(6) to be indeed close to p). Thus, the convergence may be fast even if switching between 
the two channels is impossible for some values of £. If the spectral gap 9 is sufficiently large, we 
thus recover a similar expression for the rate of convergence of the ABF method as the one derived 
in |14j . with p being now the LSI constant of the canonical measures /iuj. 

We would like to emphasize that our arguments hold under the following assumption of existence 
of regular solutions: We assume that we arc given a process (X t ,It) and a function A' t which 
satisfy (|11|) . and such that X t , conditionally on It = i has a smooth density xp(t,x,y,i). We 
suppose that this density is sufficiently regular so that the entropy estimates below are valid. We 
refer for example to [2] for an appropriate functional framework in which such entropy estimates 
hold. 

3 Proof of main result 

In order to prove the exponential decay of the microscopic entropy in Theorem [TJ we use the 
fact that the time evolution of the microscopic entropy can be expressed as a combination of the 
evolution of the total and the marginal entropies, from (|22[) 



dE m _ dE dE M 
dt dt dt 

In order to obtain results for the microscopic entropy, we begin by treating the time evolutions of 
the total entropy and the macroscopic entropy separately. 

3.1 Preliminary computations on the total entropy E 

In order to study the evolution of the total entropy, some auxiliary results will be needed and are 
given in the lemmas below. First, it will be useful to write the Fokkcr-Planck equation associated 
to ip in a different form. 



Lemma 2 The Fokker-Planck equation (I12|) for tp can be rewritten as 

d t i> = div(V>ooV(V>M>o)) + dx{{A' - A' t )i>) - A(z)(V - ^l-i)- 
Proof : By developing the right hand side, we obtain 

W = div('vV'-^VVoo)+5 x (( J 4 , -4)V)-A(x)(V-'0i- i ) 

= div (W + V(y, - A o £)V) + 8 x (A'i>) - d x (A' t ^) - \(x)(J> - Vi-i) 
= div (V(Vi - A t o OV) + AV; - A(x)(V> - Vi-i), 

which is indeed the Fokker-Planck equation (fT2|) . ^> 

The above may now be used to estimate the time evolution of the total entropy. 
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Lemma 3 The total entropy E satisfies 



dE ^ 



?:=o 



Txl 



Vln 



A 

Voc 



v-E/ (a'-^r 



In 



Voc 



Proof : First, by definition of the total entropy, we have 



dE 



4> 



- E/ mffW 

using the fact that -0 is a probability density. Next, we use Lemma [U to obtain 



V- 



(28) 



(29) 



dE 
~dt 



E 



i=0 



div VooV . 

Txl V W. 



V; 



In 



Vo 



E/ ^((A'-4)0)ln 

• n JTxR 



Voc 



E / A ^) ln 

i=0 JTxR 



A 

Voc 



(V- Vi-0 



(30) 



< - 



E 

i=0 



Txl 



Vln 



A 

Voc 



.-_n JTxl 



In 



j0_ 

V'oc 



0, 



where the last line is a result of integration by parts and the fact that (|30|) is non-positive, which 
is proved below in Lemma |4j <0> 

Lemma 4 For ip satisfying (jl 2 j) and -ip^ its long-time limit, the following holds 

-E/ A(x) ln ( -$-) (V - Vi-i) < 0. 

j_q JTxK Woo/ 

Proof : First, recall that A(x) = 0, Vx £ £. We consider therefore the left hand side of the 
inequality at hxcd (x,y) £ (T\£) X R. At fixed (x,y), we consider renormalizcd (Bernoulli) 
probabilities denoted by ipi = tpi/fyo + ipi), so that ipo +ipi = 1. 



-E 1 * 



i=0 



__1 

V'oc 



(ip-ipl-i) 



In 



Vo 



(/' 



oo.O 



(-Vo + Vl) 

(Vo + Vi)(-Vo + Vi) 

(Vo + Vi) 



111 



Voo,l 



111 



00 



Voo,0 



In 



01 



Voo,l 



-hi(^)Vo-ln(^]Vi+ln[^]Vi+ln[^|Vo 



Voo,C 







oo,l 



Voo.C 



Voo,l 



(Vo + Vl) 
(Vo + Vl) 



-21n | J*- 1 Vo - 21n [ J^- ] Vi + (Vo + Vi)ha ( -p- ) + (Vo + Vi)ln 



n 







oo,0 



0j 



k Voo,l y 

Vo 



v. 



oo,0 



Voo,l 



-2Vln 41^ + ln, 

~^ \Voo/ V^ 00 ' , 



ln 



01 



.V 



oo.l 
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From hypothesis [HI], V(x, y) £ (TT\£) x R, V'oo.i = V-'oo.o == 5) which allows the above to be written 
as 

^lnU^ (V>-^_i) = (^0+^1 

= -2(Vo+v>i) [y>[i ) ^+y]inffc]vi 00 l (31) 

0. 



i=0 






The last line is due to the fact that the two terms between brackets are non-negative, since they 
are relative entropies. <0 

Remark 3 Let us consider the case where £ = and thus Wx G T, X(x) = A > 0, which implies 
(see [HI]) Vq — V\ and therefore V'oo.o = V'oo.i everywhere on T x R. In this case, it follows 
from (|31[) that 

-W A(x)ln(-^-)ty-^i-i) 

i=0 JTxR Woo/ 

<-2A/ (VH)+^i)y"ln(i]^ 

JTxR^ VVW JtxR VVoo,0 + V'oo,1/ 

Furthermore, since the marginal i/j x ' y :— tJjq + ipi satisfies in this specific case 

d t ^y = dxvUJ°?vw v /1& v )) + d x ((A' - A' t )r-y), (32) 

one can show, using the results of \11$ , that 3C > 0, Vi > 0, 

- V I A(x)ln [-j-] {$ - ih.-i) < -2XE + 2ACe" 2 ™MM* 2 }*. 

i=0 "'TxR VV'oo/ 

Using this, and the fact that —E< —E m , one can show that E m converges to zero exponentially 
fast with rate 

2min{p, 47T 2 , A}. 

The convergence rate thus depends on A > 0, the rate at which switching occurs between the two 
channels. This is comparable to the original result obtained for the ABF algorithm, see [Ufl . The 
aim of what follows is to obtain a result in the case where £ ^= 0, namely when A = in some 
region. 



In order to estimate the last term on the right hand side of (|28jl . it will be helpful to express the 
difference of the biasing force and the mean force in terms of densities 

Lemma 5 The difference between the biasing force A' t and the mean force A' can be expressed in 
the following way 



4--4'=£/ E «»(£)!*-*>» 



i/jL 



Proof : We develop the expression on the right hand side and use the fact that ip^ = 1 
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g/,*K£)$^-*KI: 



= E / 9M^)i dy~J2 [ OMMji dy - d x ]n (</>«) 
t=0 -^ R ™ j=0 ^ R ^ 



j=0 *" K r 4=0 

The last line is a result of the definition of A' t in (|13p and the fact that A is a function of x only. <0> 
Another useful estimate for the difference between A' t and A' is given in the following lemma. 

Lemma 6 The difference of the biasing force and the mean force can be bounded by the microscopic 
entropy as: 



f 14-A'lV dx<2R 2 E m (t), 



where 

R= (c + Mp- 1/2 



Proof : We begin by showing that |AJ(x) — A'(x)\ < My / 2e m (t,x)/p. By definition, we have 

V] / d x Vii> dy y2 dxVi^oc dy 

, nil ■ r. JS. 



A' t (x)-A'(x) 



E ip dy ^2 ipoo dy 



i=0 " M i=0 ' 

1 



where ir(dy,dy') is any coupling measure on R x M. with marginals /Ujuj and /WooUj. Next, using 
Taylor's expansion on c^V^x, y), we have 



A' t (x)-A'(x) = J2 f ((d^iix^ + d^ViixM^y'My-y'^^-^V^y'^^Tridy^y') 
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where rj(y,y') G [y,y']- Recall from [H2] that 3C,M > such that 



d x Vi dfj,^ 



< C 



L°°(T) 

and ||9 x .j,V^||icx, < M. Furthermore, with the use of the Csiszar-Kullback inequality, we have 



\A' t (x)-A'(x)\ < C^ 



i=0 



^ 1% 



^ i>L 



M E^r/ \v-y'Hd v ,dy') 



< 



i=0 
1 j,€J 



?/>«• 



c«i^L|J + ^Eir / \y-y'Hdy,dy') 



i=0 



ipi 



Next, by the Talagrand inequality pip and the concavity of the function x i— > \/a;, 



^V«' 7 /2 



14(a) -^'(aOl < c v /2F(^,Ai^|J + M^^ r y-fro* tM>/ i 00 | 1Bil .) 



< CJ2H(n t \ x ,ii 00 \ x )+M A 



\ ~ / . n \^t\x,ii ^oo\x,i 



v/ 



< Cy2H'(^ t | x ,/x 00 | x ) +MJ-H(nq x ,(i 00 \ x ) 



Therefore 



1 4(a) - A'(a:)| 2 < 2 (c + M P - 1 ' 2 ) e m (t, x). 

The result follows immediately, since E m {t) = J r e m (t,x) ip^dx. <0> 

We are now equipped with the right tools to control dE/dt, and are left to handle the evolution of 
the macroscopic entropy. 

3.2 Controlling E M 

Due to the free diffusion equation ()16j) satisfied by ip^, the macroscopic entropy Em is easily 
controlled. 



Lemma 7 The macroscopic entropy satisfies 



dE^j 
dt 



d x \n 






V/ = -F(^|^). 



(33) 



Proof : Using (|16j) and integration by parts 



dt 


dt J T 


'"©* S 




= f In f^-) d xx ^ 

•If \1pooJ 




■ -/ 


W4) 



v> 5 . 



Lemma 8 The Fisher information for the marginal density ip^ decays exponentially fast with rate 
r = 8ir 2 

F(^(t,-)|^)<^cxp(-87r 2 t), 

where F = FOj^O,-)^). 
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Proof : See Lemma 12 of reference [14] • 







In light of the lemmas presented above, hypotheses [H1]-[H4] may now be used to control the 
evolution of the microscopic entropy. 



3.3 Controlling E m 

The aim of this section is to obtain an estimation on the evolution of th microscopic entropy E m 
see (|36|) below. We first begin by using the fact that the channel-local conditional measures n^x,. 
satisfy LSI(p) to bound the microscopic entropy. 

Lemma 9 Under the hypothesis [US], the microscopic entropy E m satisfies 



«.*££ 



i=0 



Txl 



dy\n 



■t 



ip dx dy + E c 



where E c is defined in 

Proof : Wc fist consider the local entropy e m (t, x), which can be decomposed into the sum of the 

channel- local entropy and the entropy of the Bernoulli measures. 



e m (t,x) = flX/Uilxl/UooIx) 



i=0 

i 

- E 

i=0 

1 

< E 

i=0 



In 



1p /V^o\ _0_ 
1p I "0oo \ 1p 



^IMWfJw* 



i' u ' </4V i A 



dy 



"£/£)** 



1 



cUn 

' ip 



4' 






dy 



1 r 



E 

i=0 



In 



^ ,tfj\ ^ 



1* I i>lo) 1* 



-t 



cUn 



■t 



Ip J , 

— r dy + e c {t,x), 



where the inequality is a direct result of [H3]. The microscopic entropy is then 
E m {t) = \ e m (t, x)-0 ? dx 



< 



-t 

On *—> 



2p ~o jT 



dyln 






ip dx dy + E c , 



as required. {> 

The time evolution of E m may now be expressed using results of Lemmas [3] and [7J 
dE m dE dE M 



dt 



dt dt 

l 



< - 



E 



i=0 



-n^Tx 



Vln 



</-' 






i/' 4 
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We may now apply Lemma [5] and use integration by parts 
dE m 



dt 



; _n JTxR Woo/ ; _ n JTxI 



a^in ( — — 

V'oo 



</' 



ELjE/^Mf:)^-^ 



0oo/ V^ 



0« 



V> 



<9 x ki | j— ) V 



/ 


9,ln ( 




2 


l=0 AxK 


<9ylnf 


l/'oo/ 


1 

E 

7— n 


/ 


d x ln( 





^ 



V" 



jt(sM£H'* 



zLMih^)* 



i=0 



0c 



<9 x ln ( -y- 

Voo 



(/'" 



(34) 



Notice that (|34|) is non-positive by the Cauchy- Schwartz inequality. We therefore have 



dErr, 

~dT 



< - 



V'o 



4=0 

E/ (htKMt 1 ) 
S^xi vv4/ Wo- ' 

1 

E 



v 



cL In 



^oo 



V> 4 



Txl 



( £ 



v- 



(M 



<9 x ln ( — g- ) t/i 



(4 -A') 



< -2p£ m + 2pE c + J [ \A' t - A'f W f d x \n (^f\ 



ipi. 



(35) 



Line Q35p is a result of Lemma[5]and the last inequality is due to LemmalHland a further application 
of the Cauchy-Schwartz inequality. Now, using Lemmas [5] and El we obtain 

dE„ 



dt 



< -2pE m + 2pE c + R^2E^^J F e~^ 2 



where we recall Fo = F(ip*(Q, OIV'oo)- Finally using Young's inequality: Ve > 0, Va, b e 
ab < so? + j^b 2 , we obtain 



^ < -2(p-Rh)E m + 2pE c + ±F QC - s * 2t , 
dt v ' 4e 



(36) 



where e > will be chosen optimally later in the proof. We are left to control the channel entropy 
term E r in order to conclude. 



3.4 Controlling E c 

The aim of this section is to obtain a control on the evolution of E c defined by (|38|) . and more 
precisely an upper bound of E c , denoted P, which is the weighted x 2 -distance between ip^' 1 and 
V4 7 > see @ and (gSJ below. 
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Recall that E c = Jj H(/j 1 L x \fi I , x )4 dx, where the integrand is the relative entropy of a 
Bernoulli measure. Poincare and logarithmic Sobolev inequalities have been studied for Bernoulli 
measures pp. In order to obtain an exponentially decaying relative entropy, however, a suitable 
semi-group and its infinitesimal generator is needed for the measure. In the case of the bi-channcl 
model (in particular [HI]), we face problems due to the region £ C T, where no exchange is per- 
mitted between the two channels: in this region, the speed at which the measure [iL x reaches 
equilibrium cannot directly be controlled. To circumvent this issue, we consider the spectral gap 
of an adequate operator and resort to the Poincare inequality. 

By the definition of E c and using the inequality Vx > 0, xln(x) < x(x — 1) and the fact that 

Si=o Jir^' 7 ^' x i *) d% = 1; wc obtain 



* - ±LA%m* 



i=0 
1 



l&I 



V' ? / Vo 



< 



£ / In {^/m 4 J E M 

E/f^TT" 1 ) ^dx-E M . (37) 

i=0 Jt Vvoi / 

E c < P, (38) 

P= £/(SH^ (39) 

In order to proceed and consider the time derivative of P, we will need some further results to 
express the evolution of the marginal density 4 . The idea is to compare the evolution of tjfi' 1 
with the dynamics of this density if A' t and L d x Vdfi t i xi were already at equilibrium (see Section 



We therefore have 
where 



Lemma 10 The Fokker-Planck equation (|15[) for 4' 1 can be rewritten as 



+d x ((A' ~ A' t )4 J ) _ \( X )^J _ ^.) 
Proof : First we will show that 

d t ^ J = d x (J Voo^ (j-\ dy\ + d x ((A' - A'A^ 1 ) - \(x)(4 J - 4-d- (41) 

By developing the right hand side, we have 

d t ^ J = d x ( f d x *p - -^c^oo) +d x f (A' - A' t )^ - X(x)(^ J - 4'^) 

= d x ( f d x (Vi - A o £)</>) + d xx ^ J + d x f {A' - A' t )^ - \{x){4^ - vfi) 

d x (d x (Vi - A t o £)V0 + d xx ^ - \{x){4 a - 4'^) 



which is indeed the Fokker-Planck equation (fT5|) associated to tp^' 1 . 
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Next, we show that Vx E T, Vz € {0, 1} 



*-°' ( £) dy = ^ a - (i? 






4> 



u 



1> 



s.i 



(42) 



To prove the above, notice that 



ipood x [ -j— ] dy - ipiod x 



■il'o 



'</' 



d x ip dy 



t,i 






dx^oo - d x ^ J + —rjdxip'L 1 



(d x (Vi-Ao^)^dy 



■n 



d x (Vi-Aot)e-M- A °Vdy 



-(Vi-Aot;) 



-1> 



Z-l 



dy 



dxV.e-^-^ dy 
{dxVi)^ dy - A\x)^ J - ^_ tfJ + A'(x)^ J 



-(Vi-Aof) 



dy 



Finally, by using the fact that the free energy A is independent of y, we obtain 



^ood*(V>M>o) dy - il&dxW^/^J) 



f m d x Viip J m dxViip c 



V 



S,i 



■ii- 



i.i 



as required. The final result (|4"0)l is obtained by substituting (|42|l into (|4T|) . 



(43) 







Notice that (|40|) is comparable to (|25j) . only with additional terms due to the fact that A t and 
L d x Vd(i t \x.i have not yet converged. The difference of the biasing force and mean force, A' t — A' , 
was already estimated in Lemmas [5] and El We are therefore left to control the remaining term. 



Lemma 11 Vx € T, Vi € {0, 1}, 



tfj 



1> 



€,i 



< MJ-H (fl t \x,i \fioo\x,i)- 



As a consequence, 



E 



J M d x Vitp j^dxVtipo 



^ 



t 



i-i 



4> u dx < E m . 

P 



(44) 



Proof : Let Ti(fJ-t\x,ii ^oc\x,i) be the set of coupling measures ontxi with marginals fit\x,i and 
fJ-oc\x.i respectively and let 7r E II . Then 



I (dxV^ - dxV^ 



{d x Vi(x,y) - d x Vi(x,y')) ir(dy,dy') 

K 

||0a;,i/Vi|| ioo / |y-y'l n{dy,dy') 



< MW-fl'(/Xt| x ,i|/ioo|a;,i) 



where we have used Lemma[T]since fJ-oo\x.i satisfies LSI(p). Equation (|44p follows immediately from 
the fact that £) i=0 / T #(A i t|z,ilA t oo|:M)V' 4 ' / dx < E m . 
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One result that is now needed to derive estimates on the evolution of P is the existence of a 

spectral gap of the operator describing the dynamics once A' t and / d x Vid[i t \ xi have converged 

Jm 
(see ([25])). We now justify the existence of such a spectral gap. 

In order to do so, let us define the vector spaces 



Vi = iv : T x {0, 1} 

and 



V* G {0, 1}, J' G L 2 (T, l&ix, i) dx),J2 [ «i(*) dx = l \ 

v™ \x, i) i=0 Jt j 



ViG{0,l}, W ' - €H 1 (T,^ I (x,i)dx), V f Wi (x)dx = l\ 

V& (x, i) l=0 Jt J 



A function <f> in Vi (or in W;) will also be considered as a vector valued function as <ft : 



T^R 2 

x ^ ((j) (x),(t) 1 (x)) 



Notice that tj> G Wi if and only if / := - V^ 7 G Wo- 

Lemma 12 Recall the operator L = (Lq,Li), with Li defined as in (|25|) . 

TTien 

zj TTie operator L is symmetric and positive definite with respect to the inner product 

1 f 1 

(f,9) = E / M x )9i{x) -£j- — - dx. 

4=0 Voc {X, %) 

ii) L has a spectral gap 6 > in the sense that 

inf %f = 9>0. 



(45) 



Proof : i) To show symmetry of the operator L, consider functions ip,<f> G Wo- Now, using 
the fact that Vx G T \ £, tp^ (x) = ^^(x), 



1 f 1 



U^HwM^i 



\{x)(pi{<j)i - 4>i-i)—rj- dx 



= E / d - ( ifr ) 9 * ( -71J- J ^ dx+ f \(x)(<po - cpx)(<f>o - <h)-^r dx ( 46 ) 

i=0 , ' T \Voo,4/ \roo,i/ ■ Jt ^00,0 

= - V / 04<9 K ( ^i^ ( x^ - ) ) TTT da; + / A O)(0o - 0i)(vo - <Pi)-rr dx 

1 /" 1 

= E / fiiLnp—rj- dx. 
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From (|46|) , we conclude positive definiteness of L 

1 



i=0 Ji Voo,i to 1 " \™°o,' 



^Ji dx + / A (^)(^o - <l>i) 2 -rr dx > o- ( 47 ) 



Notice that the above is strictly positive for any <\> £ Wo, since (cf>,L(f>) = if and only if 
0o = 0i = 0. 

iij In fact, one can check that 3k > 0, such that V0 € Wo, <t> 7^ 0, 



£ / ^Li^-iy dx^K^f (|&| 2 + |V0,| 2 ) 4t 



Therefore, by the Lax-Milgram theorem, L _1 is well defined from Vo to Wo and thus compact 
from Vo to Vo- From the symmetry and positive definiteness of L, and the fact that its inverse is a 
compact operator from Vo to Vo, it has a strictly positive and discrete spectrum. There exists a set 
of eigenvectors (v n ) n >i, orthonormal with respect to the inner product (•,•), forming a basis of Vo 
and Wo, and associated to an increasing sequence of eigenvalues (c n )n>i> such that lim o n = oo. 

In particular, there exists a spectral gap: 6 = o\ > 0. <0> 

Remark 4 In the case where a function <p £ Wi satisfies d t <j) = L<f>, a consequence of Lemma XH\ 

AS' 

Vt>0, U(t,-)-^J\\ 2 <Ke- 29t , (48) 

where j • || = (•,•) and K = } y (0(O; •) ~ ip^o i v n) ■ This is easily obtained by noticing that 

n>l 

(j> — ip^ £ Wo and therefore can be expressed in terms of the orthonormal eigenvectors (v n ) n >\ 

<t>(t, •) - V& J = £>(0, ■) - ^-» n Ke-'»' 

The result (|48|) follows immediately since Vn > 2, a n > o~\ = 9. 

Notice in particular that the Fokker-Planck equation (|40j) satisfied by %jfi> can be written as 

d t ^(t, x, i) = L^J + dx (J^jjW - /a ^°° ) ^' J ) + 9 X ((^' - A'M U - 

We will show that the last two terms can be controlled by E m , see (|49D . 

With these tools at hand, let us consider the time evolution of the functional P defined in (pi5|) 
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Using equation (j40|) of Lemma [TU1 we get 



^f - tj/ J ^ J w dI 



i=0 



<9z 






1 /• i 

^ dx-J2 KxW^i 1 - 4- l )^r dx 



tJAw>'-^'^tMw 



4 \ fL d xVip J R d x Vtlj c 



^«. 



ipic 



^ dx. 



iJ _„/,«>' 



Notice that, by developing the sum and using the fact that ip^, = tp^ 1 for X(x) ^ 0, the second 



term may be replaced by / X(x) 



44 - 44 



2 i 



, . dx. Finally by using Young's inequality on 
the last two terms, we obtain for a parameter a > to be chosen later on, 



ldP 
2~dt 



E 



_ n JT 



4=0 



d x 



r 



■'/•: 



u 



44 dx- I X(x) 



44 - 4/ 



2 1 



^oo,0 



dx 



7-Y, I & (4?) 4' 1 dx + a [ \A' - A' t \ 2 ^ dx 

JLy [ d (4\ 2 Udx j^i 



J K d x ViP f R d x Vip 



V'oo 1 



^' ! dx. 



Next, by Lemmas l6l and ITTT 



ldP _y. 



i=0 



iph 



i>^ dx - / X(x) 



44 ~ 44 



2 i 



■</• 



u 



dx 



-t 



i=Q J 1 



<-').,: 



ipso 



M 2 



ip u dx + 2a E m + 2aR 2 E r) 

P 



Notice that, in the third term, ip*' 1 < ^ < M for M = ||^ ? (0, -)\\ Loo and 1 < ip^/c for 
c = min ip^ . This gives 



ldP _y 

9 rlt - L^l 



i=0 

M ^ 

2ac f-J j- 

i=0 ,/1 



Ipoo 



tfg dx - / X(x) 



44 - 44 



2 i 



■</• 



u 



dx 



d x 



ipt 



t/4 7 dx + AaR 2 E n 



Finally, by grouping terms together and using the fact that a may be chosen such that M/2ac < 1 
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(an appropriate choice for a is given later in the proof), we have 



ldP 

2~dt 



< 



< 



9n>r I ^ 



< - 1- 



2ac 

M " 
2ac 

M " 
2ac 

M S 



i=0 
1 



E 

i=Q 

1 



d x 



^ 
^ 

(/'" 



V4 7 dx - / \{x) 



4 J - 4' 1 



2 1 



il> 



e,i 



dx + AaR 2 E„ 



t 



t,i 



V& 1 dx + / A(x) 



4' 1 - 4 J 



2 i 







u 



dx 



AaR 2 E„ 



V I (ip^Utp^—fj dx + AaR 2 E, 
l=Q h V'Si 



-^ l#P + 4ai? 2 £ m , 



(49) 



where the last line is a result of ([4"5"j) . with / := ^ ,J — -0I/- Notice that / € Wo since the 



normalization for ip^' 1 is N / ip^' dx = 1 

i=o ^ 



To complete the proof of Theorem [1] we now need to study the system of inequalities ([36 
and (Bat, 



3.5 Completing the proof 

To show that -E m decays exponentially fast, we study the system of two inequalities (|36|) and ((49 
Since, from (f3"5|) . I? c < P, the system to be studied is 



^ < -2 (p - i? 2 e) £ m + 2pP + ^F e" 



(50) 



The parameters a > ^ and e > remain to be chosen in order to obtain an exponential con- 



vergence with the best possible rate. To fix a, let us first study the eigenvalues of the matrix of 
coefficients, neglecting the terms in s. 



Lemma 13 Let us assume [H4J- The matrix 

-P 



A = 



P 



AaR 2 



1 



M 

2(w 



is negative definite and a may be chosen so that the eigenvalues — X± of A are such that 

-A_ < -A+ = -A(0) < 



where A 



i, oo) — > (0,p) is a positive, increasing function. We recall that 



MIR 2 



where R = C + Mp l ' 2 . The function A is such that A — >■ as 6 — > # m in and A — > p as 6 — > oo. 
Moreover, A(p + 26 min ) = ?-. 

Proof : In order to prove the negative definitcness of the matrix A, we show that for certain values 
of a > 0, tr(-A) < and det(A) > 0. In the following, we only consider positive values of a (which 
is imposed by the previous computations). We have 



ti(A) =- p -(l- — ]e<0 iff a> M6 

V ' ' \ 2ac) 2c(p + 



(51) 
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and 

, /n / l\ , „ , , 0c ± V0 2 c 2 -8AI8R 2 c , s 

detM =0/9 1 4ai? 2 p > iff a e (a_ ,<*+), a± = = . (52) 

\ 2ac I 8R 2 c 

The interval (a-,a + ) is indeed well denned and included in [0,oo) since > m ; n = 8MR 2 /c 
(hypothesis [H4]). We seek an optimal a that minimizes eigenvalue — A+ and satisfies (fSTj) and ((521) . 
An analytical solution cannot be easily obtained. We choose 

* M 

a = a := — , 
c 

which appears to be very close to the optimal choice, from numerical computations. Notice that a* 
satisfies flST) and fljjS) since, for a = a*, tv(A) = -p-6/2 < and det(A) = 9p/2-4R 2 Mp/c > 0. 
The eigenvalues of the matrix are now given by 



-^(-HWH)'^) <- 

The rate of convergence of the system is given by the largest of the two eigenvalues — A + . Let us 
introduce the function 

*>~;(-HWH)" + ^) 

such that A+ = A(0). It is easily shown that A is an increasing function of with 

f n OD «_> 8MB 2 
A(0) I as 0->-e-. 

[ p as — > oo. 

Moreover, it is easy to check that A(p + 20 m ; n ) = — , which concludes the proof. ^> 

We are now in position to complete the proof of Thcorcm[TJ Let us define Y(t) = (E m (t), P(t)). 
Using ([50f the fact that E m < \\Y\\2 (where ||F||2 denotes the Euclidean norm of the two- 
dimensional vector Y), we obtain 

5>i* = ii^ +p2 > 



2Y T AY + 2R 2 eE 2 + — F e~ 87r '£„ 



1 
'o' 



and as a result, 



4e 
< -2A(0) ||Y|| 2 + 2R 2 e\\Y\^ + Ipoe-^WYh, 



d \\Y\\2 ^ 0/A ^ D 2„M^„ 1 v.-to't 



< -2(A(6)-R 2 e)\\Y\\ 2 + -F e-^- t . (53) 

at 4e 

For arbitrary small e > 0, let us consider X e = A(0) — R 2 e < A(0). We may assume without loss 
of generality that A e ^ At: 2 . Then, from ([5U]) . one gets: 

E m < \\Y\\ 2 < K e e~ 2min{KA7r2}t , (54) 

where 

K £ = 2 max j V^, 2 „(0) + P 2 (0), ■ 



Fn 



8e|A e -47T 2 | 
2G 



which concludes the proof of (|27|) . 

The exponential convergence of the total entropy E results from the relation E = Em + E m , 
(|54|) and Lemmas [7] and [51 The Csiszar-Kullback inequality implies the same for \\ip(t, •) — V't»||x,i- 

Finally, the convergence results on A' t are easily obtained from Lemma [5] and the fact that ip* is 
bounded from below by a positive constant for times larger than an arbitrary small positive time, 
see the beginning of Section 3.3.2 in [14] for more details. 
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